What Is New in Our City? A Framework for Event Extraction Using Social Media Posts

نویسندگان

  • Chaolun Xia
  • Jun Hu
  • Yan Zhu
  • Mor Naaman
چکیده

Post streams from public social media platforms such as Instagram and Twitter have become precious but noisy data sources to discover what is happening around us. In this paper, we focus on the problem of detecting and presenting local events in real time using social media content. We propose a novel framework for real-time city event detection and extraction. The proposed framework first applies bursty detection to discover candidate event signals from Instagram and Twitter post streams. Then it integrates the two posts streams to extract features for candidate event signals and classifies them into true events or noise. For the true events, the framework extracts various information to summarize and present them. We also propose a novel method that combines text, image and geolocation information to retrieve relevant photos for detected events. Through the experiments on a large dataset, we show that integrating Instagram and Twitter post streams can improve event detection accuracy, and properly combining text, image and geolocation information is able to retrieve more relevant photos for events. Through case studies, we also show that the framework is able to report detected events with low spatial and temporal deviation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Emergency Event Detection in Twitter Streams Based on Natural Language Processing

Real-time social media usage is widely adapted today because it encourages quick spreading of news within social networks. New opportunities arise to use social media feeds to detect emergencies and extract crucial information about that event to support rescue operations. A major challenge for the extraction of emergency event information from applications like Twitter is the big mass of data,...

متن کامل

Disaster Analysis using User-Generated Weather Report

Information extraction from user-generated text has gained much attention with the growth of the Web. Disaster analysis using information from social media provides valuable, real-time, geolocated information for helping people caught up these in disasters. However, it is challenging to analyze texts posted on social media because disaster keywords match any texts that contain words. For collec...

متن کامل

Sequential Event Detection Using Multimodal Data in Nonstationary Environments

The problem of sequential detection of anomalies in multimodal data is considered. The objective is to observe physical sensor data from CCTV cameras, and social media data from Twitter and Instagram to detect anomalous behaviors or events. Data from each modality is transformed to discrete time count data by using an artificial neural network to obtain counts of objects in CCTV images and by c...

متن کامل

Accurate Local Estimation of Geo-Coordinates for Social Media Posts

Associating geo-coordinates with the content of social media posts can enhance many existing applications and services and enable a host of new ones. Unfortunately, a majority of social media posts are not tagged with geocoordinates. Even when location data is available, it may be inaccurate, very broad or sometimes fictitious. Contemporary location estimation approaches based on analyzing the ...

متن کامل

Information Extraction for Social Media

The rapid growth in IT in the last two decades has led to a growth in the amount of information available online. A new style for sharing information is social media. Social media is a continuously instantly updated source of information. In this position paper, we propose a framework for Information Extraction (IE) from unstructured user generated contents on social media. The framework propos...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015